Experiments on speaker profile portability

نویسندگان

  • Vincent Barreaud
  • Douglas D. O'Shaughnessy
  • Jean-Guy Dahan
چکیده

This paper addresses the problem of speaker characterization in the speaker-dependent speech recognition problem. Speaker Adaptation and Normalization techniques are designed to reduce the mismatch introduced by inter-speaker variability. Yet there is another source of mismatch introduced by intra-speaker variability. Indeed, the speaking style of a speaker depends on the nature of the speech uttered. The framework of this paper is speaker-dependent isolated-word recognition on an embedded engine. The limited computational and memory loads of this engine reduce the possible techniques for normalization. The proposed solution uses a speaker profile trained on dictation data and exported to the embedded engine. In this framework we study the portability of a task-dependent speaker profile from dictation task to command task. Experiments have been conducted on a Scansoft 255 speakers database. We show that the portability results in a loss of efficiency due to the nature of the considered tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Overview of Automatic Speaker Diarisation Systems

Audio diarisation is the process of annotating an input audio channel with information that attributes (possibly overlapping) temporal regions of signal energy to their specific sources. These sources can include particular speakers, music, background noise sources and other signal source/channel characteristics. Diarisation can be used for helping speech recognition, facilitating the searching...

متن کامل

Static, dynamic, and relational properties in vowel perception.

The present work reviews theories and empirical findings, including results from two new experiments, that bear on the perception of English vowels, with an emphasis on the comparison of data analytic "machine recognition" approaches with results from speech perception experiments. Two major sources of variability (viz., speaker differences and consonantal context effects) are addressed from th...

متن کامل

Tree-based estimation of speaker characteristics for speech recognition

Speaker adaptation by means of adjustment of speaker characteristic properties, such as vocal tract length, has the important advantage compared to conventional adaptation techniques that the adapted models are guaranteed to be realistic if the description of the properties are. One problem with this approach is that the search procedure to estimate them is computationally heavy. We address the...

متن کامل

Estimating speaker characteristics for speech recogni- tion

A speaker-characteristic-based hierarchic tree of speech recognition models is designed. The leaves of the tree contain model sets, which are created by transforming a conventionally trained set using leaf-specific speaker profile vectors. The non-leaf models are formed by merging the models of their child nodes. During recognition, a maximum likelihood criterion is followed to traverse the tre...

متن کامل

One DODer's View of ARPA Spoken Language Directions

DOD support for ARPA/HLT speech research stems from the belief that large vocabulary continuous speech recognition and speech understanding will have advantages in portability, and in the variety of applications sustainable from a single, basic speech system. With some imagination one can envision applications in such remotely related areas as speaker identification and language identification....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005